A New Approach to Parallel Sparse Cholesky Factorization on Distributed Memory Parallel Computers Mounir Hahad, Jocelyne Erhel, Thierry Priol

نویسندگان

  • Mounir HAHAD
  • Jocelyne ERHEL
  • Thierry PRIOL
چکیده

Nowadays, programming distributed memory parallel computers (DMPCs) evokes the \no pain, no gain" idea. That is, for a given problem to be solved in parallel, the message passing programming model involves distributing the data and the computations among the processors. While this can be easily feasible for well structured problems, it can become fairly hard on unstructured ones, like sparse matrix computations. In this paper, we consider a relatively new approach to implementing the Cholesky factorization on a DMPC running a shared virtual memory (SVM). The abstraction of a shared memory on top of a distributed memory allows us to introduce a large-grain factorization algorithm, synchronized with events. Several scheduling strategies are compared, and experiments conducted so far show that this approach can provide the power of DMPCs and the ease of programming with shared variables. Une nouvelle approche pour la factorisation de Cholesky de matrices creuses sur les machines parall eles a m emoire distribu ee R esum e : L'exploitation eecace de machines parall eles a m emoire distri-bu ee n ecessite des eeorts importants de la part des utilisateurs. En eeet, la mise en uvre d'applications parall eles dans le mod ele de programmation par envoi de messages requiert une distribution des donn ees ainsi que des calculs sur les dii erents processeurs. Cette t^ ache peut se r ev eler complexe sur des pro-bl emes irr eguliers tels que les calcul sur matrices creuses. Dans cet article, nous consid erons une nouvelle approche de la factorisation de Cholesky de matrices creuses sur les machines a m emoire distribu ee dot ees d'une m emoire virtuelle partag ee. L'abstraction d'une m emoire partag ee au dessus d'une m emoire physique distribu ee nous permet d'introduire un algorithme de factorisation a grain large synchronis e par ev enements. Plusieurs strat egies d'ordonnancement sont compar ees. Les r esultats actuels montrent que nous pouvons aboutir a une utili-sation eecace des machines parall eles a m emoire distribu ee, tout en conservant l'avantage d'une programmation ais ee gr^ ace a la communication par variables partag ees.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Irregular Loop Patterns Compilation on Distributed Shared Memory Multiprocessors

This paper addresses irregular loops compilation on Distributed Memory Parallel Computers (DMPCs) that provide a Shared Virtual Memory. Runtime techniques are introduced to distribute irregular loops so that page movements are reduced. Experimental results for the KSR1 are presented.

متن کامل

A New Approach to Parallel Sparse Cholesky Factorization on Distributed Memory Parallel Computers

Nowadays, programming distributed memory parallel computers (DMPCs) evokes the \no pain, no gain" idea. That is, for a given problem to be solved in parallel, the message passing programming model involves distributing the data and the computations among the processors. While this can be easily feasible for well structured problems, it can become fairly hard on unstructured ones, like sparse ma...

متن کامل

Extended Abstract: "No-Compile-Time Knowledge" Distribution of Finite Element Computations on Multiprocessors

This paper addresses partitioning and scheduling of irregular loops arising in finite element computations on unstructured meshes. Target computers are Distributed Memory Parallel Computers that provide a global address space. We introduce the concept of “)conditioned Iterations Loop” which distributes the iterations dynamically according to a runtime condition. This technique is improved by a ...

متن کامل

Parallel Sparse Matrix by Vector Multiplication using a Shared Virtual Memory Environment

Many iterative schemes in scientiic applications require the multiplication of a sparse matrix by a vector. This kernel has been mainly studied on vector processors and shared-memory parallel computers. In this paper, we address the implementation issues when using a shared virtual memory system on a distributed memory parallel computer. We study in details the impact of loop distribution schem...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1993